Dynamic Properties of Random 
Heteropolymer with Correlations in Sequence. 

E.Sh.Mamasakhlisov & V.F.Morozov 
30.09.98 



1 



Abstract 



The some dynamic properties of a random heteropolymer in the con- 
densed state are studied in the mode coupling approximation. In agreement 
with recent report a dynamic friction increasing is predicted for the random 
heteropolymer with power-law correlations in comparison with exponential 
correlations. In the case of strong power-law correlations (f3 > (3*) the dy- 
namic friction function diverge in the thermodynamic limit. The qualitative 
explanation is given for the monomer's dynamics slowing down and diverged 
energetic barrier between the frozen and random coil states. The possible 
relations with protein's function and evolution are discussed. 
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1 Introduction. 



From the molecular point of view the prebiotic evolution is the creation and 
subsequent change of the set of sequences of proteins and nucleic acids from 
the primordial soup. The one of important features of biologically functional 
proteins is the ability of recognizing an a priori given target molecule. One 
of possible mechanisms of such properties origin was suggested in [1,2]. It 
is the fast polymerization of monomers in presence the target molecule. It 
was shown that this procedure produced the slight power-law correlations 
in sequence of the forming macromolecule due by the interactions between 
target molecule and monomers. It was shown also that similar type of cor- 
relations observed in sequences of real proteins. This result was obtained 
[1] by averaging over the ensemble of different sequences of the homological 
proteins. At the last years the some interesting ideas were suggested that 
try to explain the possible ways to genetic code origin [3,4]. For example, a 
correspondence between nucleotide and amino acid coding units, may have 
origin from the sequence-specific interactions between abiotically synthesized 
polynucleotides and polypeptides [4]. This point of view is in interplay with 
mechanism, proposed by Pande et al.[l]. 

The role that protein's and random heteropolymer's sequence correlation 
properties play in their structure and function was investigated previously in 
[5,6]. In particularly, it was shown [6] that in random heteropolymers ( RHP 
) the power-law correlations produced the energetic barrier between folded 
and random coil states. This result was interpret as an evidence of power-law 
correlations only in non-coding sequences of DNA, that is corresponded to 
results of Peng et al. [7,8 ]. 

Purely dynamic theories in mode coupling fashion were derived previously 
in [9-12] for completely random heteropolymers. . So, Roan and Shakhnovich 
[10] derive a mode-mode coupling theory of the RHP and explicitly solved 
it for a polymer in a good solvent. Thirumalai et al. [11] derived dynamic 
equations in the framework of dynamic functional integral ( DFI ) method for 
a Garel, Leibler and Orland model [13]. The objective of the present study is 
random heteropolymer's dynamics treatment in presence of sequence power- 
law correlations. We have considered here the Langevin dynamics of RHP 
with correlated sequence in mode-coupling approximation. The main goal 
of the present article is the investigation of some dynamic properties of the 
condensed state of RHP. 



3 



2 Model and Formalism. 



Let us consider the heteropolymer chain with a frozen sequence of different 
monomers. On the microscopic scale thermal motion in macromolecule pro- 
duces small deformations of valent bonds and angles and causes rotations 
around single bonds. It is widely believed [14] that at the low temperatures 
in the long macromolecules the long- wave fluctuations are dominant. Thus, 
the global properties of the polymeric chain defined by collisions not merely 
between monomers, but rather between parts of the chain. It was shown [15] 
that interactions in the polymeric chain may be qualitatively represented as 
interactions in the cloud of " quasimonomers" with the renormalized charac- 
teristics of interaction. 

Thus, we can assume that the state of the polymer chain described by 
coarse-grained beads coordinated "^(s) and their momenta ~j?(s) and also by 
the those coordinates ~r* M and ~p* M that describe the precise local structure 
of a polymer. 

Specifically, we examine the monomer interaction energy of the form 

H = E + K + H mc (1) 

where K is the kinetic energy. The U mic is the potential energy for interac- 
tions of microscopic coordinates with other degrees of freedom. E(r) is the 
coarse-grained "beads-on the string" Hamiltonian 

Td N 

E (r) = ^2 I> - r *+i) 2 + E B ss'Krs - ry) + V(r) (2) 

s — 1 s<s' 

where B ss > = Ba s a s ' is second virial coefficient describing two-particle in- 
teractions. {<t s } are variables, describing species of monomers. V(r) is the 
excluded volume interactions approximation by effective confinement term 

V(r) = B*J2rl (3) 

s 

where the constant £?*chosen so that the radius of gyration becomes the 
value, required by the monomers packing density in the globular state [16]. 
B ss > is the effective second virial coefficient of the interaction between parts 
of the chain. B ss > represents the collective property of all the monomers of 
the chain. Taking into account the central limit theorem, we can choose {<j s } 
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as random variables, characterized by Gaussian distribution function in the 
form 

-i(V,x-V)j (4) 



P {a s } oc exp 



where 7? = {cr s }. K is the matrix describing correlations in the chain se- 
quence 

K{(t s ,(t s >) =o7o7 ( 5 ) 

Following the generalized Rause theory, derived by Hess [17,18] for the 
entangled polymeric liquids, consider the dynamic equations, described ran- 
dom heteropolymer behavior. 

Let us a complete set of degrees of freedom is {X. (t)}. Then the equation 
of motion for X. (t) is 

jX.it) ^ -LX.it) (6) 



where L is Liouvillian given as 

N 



s=l 



v(s)V r{s) - ^-V r(s) HV v{s) 

fit 



-£(V^ V rM # mic Vj (7) 



m 



vis) and are velocities of beads ( segments )and of microscopic con- 
stituents, m and m M are the corresponding masses. The projection oper- 
ator formalism shows that Liouville equation can be reduced into the form 
of generalized Langevin equation [17,18,19] 



d 
It 



i 

r(s,t) — —T ^2 f dt'fjL a9l (t-t') W r^",t')+ 9 (s,t) 

1 11 *J 



(8) 



s's" 



where gis,t) satisfied to fluctuation dissipation theorem 

(g(s,t)g(s',t')) = T» M ,(t-t') (9) 
7 S ' S " are the elements of the nearest-neighbor matrix for an open ideal chain 



d 

T ' " = JO 

Is s _.9 . 



2 ss 



(10) 
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As it was shown in [18] the dynamic mobility function fullfil the following 
equation 

Co Co J 

where A( s " s '(r) is the dynamic friction function produced by force correla- 
tions between beads ( coarse-grained coordinates ) 



Tp( s ) = -^r{s)Hi n t 



(12) 



where H int = £ s<s ' B ss ,5(r s - r a >) 
It was shown that [18] 



AC'W = P(f P (s)e-^ t f p (s))=f3{f p (s)e-"f p (s))+ (13) 
+P E / dT(7 p (s)e-^ir(s'))A( s/ ,AT) 



s"=l 

The two-particle interaction forces given as 

7 P (s) = -Vr(s)H int = -\b\T I tte- l ^ r(s) a s m(-lf) 



where 



m 



(14) 
(15) 



and 



(If ) = ^ Try (if) = a s' ex P [-^^(s') 

s s 

r r d d q 
J q "' = J j2nY 

are defined by analogy with the bead's collective density fluctuations. Thus, 
the dynamic friction function is equal to 

E< 



A C SS '0) = ~\b\ T tl^Y,( m s(T)m Sl {-T)e Lt rn s/ {-f)m S2 {-^))- 

J 1 J P Sl S 2 

"I&IE / r^^^K(Y)m sl (-Y)e-^( t -)^(/))AC s " / 



si 
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The four-point correlation function at the right side of the last equation can 
not be calculated rigorously. Following [18] choose as the simplest approxi- 
mation for collective density fluctuations 



t,t) = £ R ss > (?, t)m, (?, 0) = £ R ss' (t: t)m, (?) (16) 



The relaxation functions R ss >(~cf,t) are defined by expression 

(m s ( ~ct,t)m s >(- if)) =Y,R ss "(~?i t )( m s"(~?) m s'(-~?)) 

s 

In this approximation 

A£.'(f) = -\b\ 2 T [ f??J2J2R s , s „(?,t)R S2S3 (-?,t) 

JqJp *i»*s"s 3 

(m s (?)m Sl (-?)m s >> (~j?)m S3 (-?)) - 

-\b\ f / t dr( l f)^K(f) msi (-f) e A(/))AC / ,(r) 

J 1 J Sl 

It was shown that [18] 

|2 



AOW = -\b\ 2 T ttET,Rs's"(t,t)R S2 s 3 (-t,t) 

Jq Jp 

{m s (-cf)m Sl (-?)m sll (-f)m s . A (-^?)) (17) 

At the integration over if the expression 

(m s (?)m si (-lt)m 8 » (~f)m S3 (-if)} 

gives nonzero contribution only when beads Si and s are in contact. However, 
the contact between si and s have influence on the contact between the beads 
s 3 and s , if they belongs to subchain (si — s) ( see fig.l ). The loop formation 
between si and s sufficiently reduced the available conformations space as 

{Number of subchainfsi — s) loop conformations} ._£_ 

■ oc \s 1 - s\ 2 (18) 



{ Total number of subchain (s 1 — s) conformations } 
(In the case of ideal Gaussian chain d* = d ). 



7 



By this reason, the loop formation between s\ and s have increased the 
probability of contact formation for the beads belongs to subchain (si — s). 
Taking into account this approximation 

A( ss ,(t) = -\b\ 2 T [ E R s > s »(?,t)R S2S3 (-?,t)x 

JqJp s ^s" S3 eis, Sl ] 



( 



a^a^e^^e^^) = ~\b\ 2 T fY, E R s 's"(t,t)R S2S3 (-t,t)x 

Psvs 2 s '' S36[SiSl] 

Jq 

where 

I (ssi) = ~r*(s) — ~7*(si) 
The last integral can be evaluated as [18] 

( / -f^e-^+^iss^ „ _ p 2 I (e -4t (ssi)) 

Jq JQ 

Thus, 

A C/(0 — \b\ 2 T I E E a s a Sl a^a S3 f (e-^ r ^) 
J P S1S2s " S3e[s!Sl] J Q 

fp 2 R s > s »(?,t)R S2S3 (-t,t) (19) 



Because of the integral over is dominated by the large p modes only 
correlations between small numbers of beads have contribution. Then we 
assume that in the limit of large wave vectors the correlation between different 
beads vanishes and only single bead propagator survives 

/ p 2 R s > s »(t,t)R S2S3 (-t,t) » <W'*«2«3 / P 2 G 2 (t,t) 

where G(~p*, t) is the single bead propagator. 
Consequently, dynamic friction function is 

AC lu M = \b\ 2 T*„,(l?) fi?(?Or,t) (20) 
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where 

E E <T.a n a.>a n J ^7^) 

si>s s 2 e[s,si] Q 

Thus, the generalized Langevin equation for the beads coordinates 

j t r(s, t) = ~ £ |* diV M '(* " f'KvKs", t) + <?(s, t) (21) 

s s 

where 

(<7(M)> = 

(g(s,t)g(s,t'))=TTti M ,(t-t') (22) 

= - r f drJ2^At - t) AC,» At) 

Co Co J Q 

The dynamic friction functions A( ss '(t) are defined by eqn.(20). In spite of 
the case investigated in [18], the A Sjj i are the elements of the nearest-neighbor 
matrix for the open chain with the confinement interactions (3). 

Let us estimate the magnitude of the dynamic friction function in depen- 
dence of correlations in 7? = {<7i, a N }. Value of the \& ss /("c?) is defined 
by the summation over large number of beads and formally similar to the 
energy with corresponding interaction potential. Thus, \& ss ' ("a*) is the exten- 
sive quantity. It's reasonable to suppose that \I/ ss ' (V) is the self-averaging 
quantity. In this case 

=*^J=E E W 1 ^'^/(e- < ^ (Ml) ) (23) 

si>s' S2G[s,si] ^ 

The main object of our interests is the condensed state of random heteropoly- 
mer. It is well known that in this state any subchain behaves as ideal and 
one-time ( equilibrium ) correlations described by Gaussian statistics. The 
deviation from the Gaussian statistics produced a great entropy lost by order 
~ N. Consequently, 

^ e -*Q I («si)\ _ e -Q 2 b 2 \s- Sl \ 

where b is the length of the effective statistical segment. Thus, 
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where 



= E E ^^^e-^l— 1 1 

si>s' S26[s,Sl] 



E E OaOa^s'Oa 



-Q 2 b 2 \s- Sl \ 



si>s' s 2 e[s,si] 

Because of variables "c? are Gaussian distributed and 07 = 0, then 
°sO"siOV o" S2 = & s a Sl ■ & s >cr S2 + a s a s > ■ a Sl a S2 + o^o^ • cr Sl a s > 
Q mode of the ty ss > (er) may be estimated as 

= E TO7e"° 262|s_sl1 E ^+ 



Sl>S 



Sl>S 

+ E 



Sl>S 



<7 S <7 'e 



) 2 b 2 \s- 



ll E ^1^2 + 

S2e[s,Si] 

Q 2 6 2 |s-si| E 

S2€[s,Sl] 



(24) 



(25) 



It is obvious that in dependence of rate of correlation function decay the 
magnitude of & SS >(Q) may be quite different. 

First of all, let us consider the exponential decay of inter monomer's 
correlations. In this case 



K(s, s) oc exp 



s — s 



(26) 



For simplicity, we have restricted our consideration by the case s ~ s . Then 
the main term, contributed to $ ss ' ( Q ) is 



^r Q2&2 i s - Sl i 



s±>s 



E a S!°S2 = 
S26[s,Sl] 



-Q 2 b 2 \s- Sl \ 



E a *i 



a 



A' 2 



Sl>S 



S26[s,Sl] 



It is obvious that for the exponential correlations 

S2€[s,Sl] 
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is finite for any range of summation. Consequently, 

lim < oo 



N — >oo " ss 

for any value of Q. Moreover, 

id/2 

si>s' S2€[s,si] «i>s 



is finite in the thermodynamic limit ( N >> 1 ). In spite of exponential 
correlations, the power-law decay leads to the following picture. Suppose 
that 

K(s,s) oc s-s' (27) 

Then, 



AT M 



^(3) = ^E^ 26Vsi1 E ^-^E^ Q2b2M El^-^l 

si>s s 2 e[s,si] M=0 S=0 

(28) 

It is known that [20] the last sum may be asymptotically estimated as 
for M > 1. Consequently, 

N 

Thus, 



6 2 M 

1 vi c 
M=0 



<MT?) = / $^(g) ~ rtf £ M^ 2 (29) 



N 

'Q M=0 

In the thermodynamic limit *& ss '( 0*) diverge if 

/?>/?* = d/2 — 1 (30) 
In the case of three-dimensional space the "critical value" of (3 is 

/?* = \ (31) 
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Thus, the magnitude of dynamic friction function is strictly dependent from 
the type of correlations. For the exponential decay the dynamic mobility is 
more much than in the case of log-range ( power-law ) correlations. Moreover, 
for the strong (/3 > (3*) power-law correlations the long heteropolymer chain 
(N — » oo) exhibits a zero mobility in the globular state. 

This situation is similar to the case of strong entanglements, described in 
[18], where 



3 Discussion 

The subject of present article is investigation of relation between the protein's 
sequence and its dynamics. 

In the case of power-law correlations the dynamic friction behavior is 
qualitatively distinguished from those in the heteropolymer with short-range 
correlations. While, for the exponential correlations the A( ss '(t) is finite in 
the thermodynamic limit, in the case of power-law correlation (f3 > 1/2) 



(N ^> 1). Consequently, the dynamic mobility of chain segments in the 
globular state sufficiently decreased with the range of correlations increasing. 

This result give us an ability to explain the possible mechanism of large 
energetic barrier existence between frozen and random coil states that was 
shown in [6]. The long-range correlations in sequence must be increased the 
probability of advantageous contacts for the neighbor and especially non- 
nearest neighbor monomers of chain that effectively slowing down the mutual 
slip of the chain parts ( see fig.2). It may be explain the so strong depen- 
dence of dynamic mobility from the range of correlations. The equilibrium 
properties of the RHP defined by averaging over the all chain conforma- 
tions. For the long-range correlations in the chain sequence the extended 
regions of similar monomers are dominated. In the globular state, character- 
ized by monomer's position fluctuations, the averaging over all conformations 
included also a very disadvantageous contacts between extended regions of 
different monomers. This produce a great energetic barrier for the RHP with 
power-law correlations. 




o 



(32) 
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What may be the constructive role of such correlations in sequence? 

One of the necessary properties of biologically functional protein is it 
ground state stability, provided by the energetic barrier existence between 
native and misfolded states. The long heteropolymer chain have a very low 
mobility in the state of disordered globule and denaturation/renaturations 
processes are very difficult in this state. But the real protein's single chain 
length in most cases don't exceed ~ 10 2 amino acids. It is due not only by 
the possible errors of replication from the very long segments of RNA. It 
may be related with necessity to have the stable native state. It is obvious 
that the random coil may be transferred to the folded ( native ) state only 
through the disordered globule state. It's due by the low entropy of the 
Gaussian chains in this state in comparison with those in any other state of 
the polymeric chain [14]. The decreased mobility of disordered globule state 
should be stabilize the native state of the system. 

The results, described above may be relevant to those, obtained by Pande 
et al. [1,2]. The power-law correlations in sequence can additionally stabi- 
lized the frozen state of the heteropolymeric globule. 
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Figure 1. 

The contact formation between the chain residues s and si increased the 
probability of the internal residues (s and s 3 ) contact formation. 
Figure 2. 

Schematic representation of the mutual movement of the chain parts in 
the presence of the long-range correlations, z are the disadvantageouse con- 
tacts between different monomers and x are the advantageouse contacts be- 
tween similar monomers. 



15 



